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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: SOLVAY (Societe Anonyme) 

5 (B) STREET: rue du Prince Albert, 3 3 

(C) CITY: Brussels 

(E) COUNTRY: Belgium 

(F) POSTAL CODE: 1050 

(G) TELEPHONE: (02) 509.61.11 

10 (ii) TITLE OF INVENTION: Xylanase, microorganisms 

producing it, DNA molecule [sic] , methods for preparing 
this xylanase and uses of the latter 

(iii) NUMBER OF SEQUENCES: 29 

(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS: MS-DOS 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CAAATCGTCA CCGACAATTC CATTGGCAAC CAC<3ATGGCT ATGATTATGA ATTTTGGAAA 



60 



# 
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GATAGCGGTG GCTCTGGGAC AATGATTCTC AATCATGGCG GTACGTTCAG TGCCCAATGG 
AACAATGTTA ACAACATATT ATTCCGTAAA GGTAAAAAAT TCAATGAAAC ACAAACACAC 
CAACAAGITG GTAACATGTC CATAAACTAC GGAGCCAACT TCCAACCAAA TGGTAATGCG 
TATTTATGCG TCTATGGTTG GACTGTTGAC CCTCTTGTCG AATATTATAT TGTCGACAGT 
TGGGGCAACT GGCGTCCACC AGGAGCAAGG CCTAAGGGGA CCATCACTGT TGATGGAGGA 
ACATATGATA TCTACGAGAC TCTTAGAGTC AATCAACCCT CCATTAAGGG GATTGCCACA 
TTTAAACAAT ATTGGAGTGT TCGAAGATCG AAACGCACGA GTGGCACGAT TTCTGTCAGC 
AACCACTTTA GAGCGTGGGA AAACTTAGGG ATGAATATGG GGAAAATGTA TGAAGTCGCG 
CTTACTGTAG AAGGCTATCA AAGTAGCGGA AGTGCTAATG TATATAGCAA TACACTAAGA 
ATTAACGGTA ACCCTCTCTC AACTATTAGT AATGACGAGA GCATAACTTT GGATAAAAAC 
AAT 

(2) INFORMATION FOR SEQ ID NO: 2: 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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CAA ATC GTC ACC GAC AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT 
Gin He Val Thr Asp Asn Ser He Gly Asn His Asp Gly Tyr Asp Tyr 
1 5 10 15 

GAA TTT TGG AAA GAT AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT 
Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met He Leu Asn His 
20 25 30 

GGC GGT ACG TTC AGT GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC 
Gly Gly Thr Phe Ser Ala Gin Trp -Asn Asn Val Asn Asn He Leu Phe 
35 40 45 

CGT AAA GGT AAA AAA TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT 
Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

AAC ATG TCC ATA AAC TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG 
Asn Met Ser He Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 
65 70 75 80 

TAT TTA TGC GTC TAT GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT 
Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

ATT GTC GAC AGT TGG GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG 
He Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

GGG ACC ATC ACT GTT GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT 
Gly Thr He Thr Val Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu 
115 120 125 



AGA GTC AAT CAA CCC TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT 
Arg Val Asn Gin Pro Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr 
130 135 140 
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TGG AGT GTT CGA AGA TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC 
Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr lie Ser Val Ser 
145 150 155 160 

AAC CAC TTT AGA GCG TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG 
Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 

TAT GAA GTC GCG CTT ACT GTA GAA GGC TAT CAA AGT AGC GGA AGT GCT 
Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 185 190 

AAT GTA TAT AGC AAT ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT 
Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 

ATT AGT AAT GAC GAG AGC ATA ACT TTG GAT AAA AAC AAT 
lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: auiiino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

- (v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) STRAIN: 720/1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Gin lie Val Thr Asp Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr 
1 5 10 15 



Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met lie Leu Asn His 
20 25 30 

Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe 
35 40 45 

Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

Asn Met Ser lie Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 
65 70 75 80 

Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

lie Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

Gly Thr lie Thr Val Asp Gly Gly Thr Tyr Asp lie Tyr Glu Thr Leu 
115 120 125 

Arg Val Asn Gin Pro Ser lie Lys Gly lie Ala Thr Phe Lys Gin Tyr 
130 135 140 

Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr lie Ser Val Ser 
145 150 155 160 

Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 



Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 185 190 



# 9 
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Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 

lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ATGAGACAAA AGAAATTGAC GTTGATTTTA GCCTTTTTAG TTTGTTTTGC ACTAACCTTA 60 

CCTGCAGAAA TAATTCAGGC ACAAATCGTC ACCGACAATT CCATTGGCAA CCACGATGGC 120 

TATGATTATG AATTTTGGAA AGATAGCGGT GGCTCTGGGA CAATGATTCT CAATCATGGC 180 

GGTACGTTCA GTGCCCAATG GAACAATGTT AACAACATAT TATTCCGTAA AGGTAAAAAA 24 0 

TTCAATGAAA CACAAACACA CCAACAAGTT GGTAACATGT CCATAAACTA CGGAGCCAAC 300 

TTCCAACCAA ATGGTAATGC GTATTTATGC GTCTATGGTT GGACTGTTGA CCCTCTTGTC 3 60 

GAATATTATA TTGTCGACAG TTGGGGCAAC TGGCGTCCAC CAGGAGCAAC GCCTAAGGGG 4 20 

ACCATCACTG TTGATGGAGG AACATATGAT ATCTACGAGA CTCTTAGAGT CAATCAACCC 4 80 

TCCATTAAGG GGATTGCCAC ATTTAAACAA TATTGGAGTG TTCGAAGATC GAAACGCACG 54 0 



AGTGGCACGA TTTCTGTCAG CAACCACTTT AGAGCGTGGG AAAACITAGG GATGAATATG 



600 
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GGGAAAATGT ATGAAGTCGC GCTTACTGTA GAAGGCTATC AAAGTAGCGG AAGTGCTAAT 
GTATATAGCA ATACACTAAG AATTAACGGT AACCCTCTCT CAACTATTAG TAATGACGAG 
AGCATAACTT TGGATAAAAA CAAT 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC TTT TTA GTT TGT TTT 
Met Arg Gin Lys Lys Leu Thr Leu lie Leu Ala Phe Leu Val Cys Phe 
-25 -20 -15 

GCA CTA ACC TTA CCT GCA GAA ATA ATT GAG GCA CAA ATC GTC ACC GAG 
Ala Leu Thr Leu Pro Ala Glu He lie Gin Ala Gin He Val Thr Asp 
-10 -5 -11 5 



AAT TCC ATT GGC 
Asn Ser He Gly 

AGC GGT GGC TCT 
Ser Gly Gly Ser 
25 



AAC CAC GAT GGC 
Asn His Asp Gly 
10 

GGG ACA ATG ATT 
Gly Thr Met He 



TAT GAT TAT GAA 
Tyr Asp Tyr Glu 
15 

CTC AAT CAT GGC 
Leu Asn His Gly 



TTT TGG AAA GAT 
Phe Trp Lys Asp 
20 

GGT ACG TTC AGTl 
Gly Thr Phe Ser 
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GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC CGT AAA GGT AAA AAA 
Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe Arg Lys Gly Lys Lys 
40 45 50 

TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT AAC ATG TCC ATA AAC 
Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly Asn Met Ser lie Asn 
55 60 65 

TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG TAT TTA TGC GTC TAT 
Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala Tyr Leu Cys Val Tyr 
70 75 SO 85 

GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT ATT GTC GAC AGT TGG 
Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr lie Val Asp Ser Trp 
90 95 100 

GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG GGG ACC ATC ACT GTT 
Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys Gly Thr He Thr Val 
105 110 115 

GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT AGA GTC AAT CAA CCC 
Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu Arg Val Asn Gin Pro- 
120 125. 130 

TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT TGG AGT GTT CGA AGA 
Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr Trp Ser Val Arg Arg 
135 140 145 

TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC AAC CAC TTT AGA GCG 
Ser Lys Arg Thr Ser Gly Thr He Ser Val Ser Asn His Phe Arg Ala 
150 155 160 165 



TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG TAT GAA GTC GCG CTT 
Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met Tyr Glu Val Ala Leu 
170 175 180 



ACT GTA GAA GGC TAT CAA AGT AGC 
Thr Val Glu Gly Tyx Gin Ser Ser 
185 



GGA AGT GCT AAT GTA TAT AGC AAT 
Gly Ser Ala Asn Val Tyr Ser Asn 
190 195 



ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT ATT AGT AAT GAC GAG 
Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr lie Ser Asn Asp Glu 
200 205 210 

AGC ATA ACT TTG GAT AAA AAC AAT 
Ser lie Thr Leu Asp Lys Asn Asn 
215 220 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 cunino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) SOURCE: 720/1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Arg Gin Lys Lys Leu Thr Leu lie Leu Ala Phe Leu Val Cys Phe 
-25 -20 -15 



Ala Leu Thr Leu Pro Ala Glu lie He Gin Ala Gin He Val Thr Asp 

.11 5 



- 60 - 



Asn Ser He Gly Asn His Asp Gly Tyr Asp Tyr Glu Phe Trp Lys Asp 

10 



15 20 



ser Gly Gly Ser Gly Thr Met He Leu Asn His Gly Gly Thr Phe Ser 
25 30 35 

Ala Gin Trp Asn Asn Val Asn Asn He Leu Phe Arg Lys Gly Lys Lys 
40 « =° . 



Phe Asn Glu Thr Gin Thr His 



Gin Gin Val Gly Asn Met Ser lie Asn 



55 60 65 



Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala Tyr Leu Cys Val Tyr 



70 



75 



80 



85 



Gly . Trp Thr Val Asp Pro Leu 



Val Glu Tyr Tyr lie Val Asp Ser Trp 



Gly Asn 



90 95 

Trp Arg Pro Pro Gly Ala Thr Pro Lys Gly Thr He Thr Val 
105 HO H5 



Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu Arg Val Asn Gin Pro 



Asp 

120 



125 130 



He Lys Gly He Ala Thr Phe Lys Gin Tyr Trp Ser Val Arg Arg 
135 



Ser 

140 



ser Lys Arg Thr Ser Gly Thr He Ser Val Ser Asn His Phe Arg Ala 
150 

Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met Tyr Glu Val Ala Leu 
170 175 180 

Thr val Glu Gly Tyr Gin Ser Ser Gly Ser Ala Asn Val Tyr Ser Asn 
185 190 1" 
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Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr lie Ser Asn Asp Glu 
200 205 210 

Ser lie Thr Leu Asp Lys Asn Asn 
215 220 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

ATGAGACAAA AGAAATTGAC GTrGATTTTA GCCrmTAG TTrGTrTTGC ACTAACCTTA 
CCTGCAGAAA TAATTCAGGC A 

(2) INFORMATION FOR SEQ ID NO: 8: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC TTT TTA GTT TGT TTT 48 
Met Arg Gin Lys Lys Leu Thr I^eu lie Leu Ala Phe Leu Val Cys Phe 
15 10 15 



GCA CTA ACC TTA CCT iGCA GAA ATA ATT CAG GCA 
Ala Leu Thr Leu Pro Ala Glu lie lie Gin Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) SOURCE: 720/1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Arg Gin Lys Lys Leu Thr Leu lie Leu Ala Phe Leu Val Cys Phe 
1 5 10 15 



Ala Leu Thr Leu Pro Ala Glu lie lie Gin Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AAATTGAATT GTGTATATCT AATGATAACG ACAAATCGTC ACTGTTTTTA AACTAATCTC 60 

AAACCAATAC TTCTTTATTT AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTTT 120 

ATAGGAACTT TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 180 

CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC TCATTCATAC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 300 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 3 60 

TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TGATCAAACA 42 0 

AGCAGCTATC CAAAATUVCAC TGATGTTGAC CTCTTT^T^GA AGTGTCACTA TCTATGAAAA 4 80 

GATAATTATC CAGTTTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 54 0 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 600 

TTAAAAGGAG GAATGCCTAA TGAGACAAAA GAAATTGACG TTGATTTTAG CCTTTTTAGT 660 

TTGTTTTGCA CTAACCTTAC CTGCAGAAAT AATTCAGGCA CAAATCGTCA CCGACAATTC 720 

CATTGGCAAC CACGATGGCT ATGATTATGA ATTTTGGAAA GATAGCGGTG GCTCTGGGAC 780 

AATGATTCTC AATCATGGCG GTACGTTCAG TGCCCAATGG AACAATGTTA ACAACATATT 84 0 

ATTCCGTAAA GGTAAAAAAT TCAATGAAAC ACAAACACAC CAACAAGTTG GTAACATGTC 9 00 

CATAAACTAC GGAGCCAACT TCCAACCAAA TGGTAATGCG TATTTATGCG TCTATGGTTG 9€0 

GACTGTTGAC CCTCTTGTCG AATATTATAT TGTCGACAGT TGGGGCAACT GGCGTCCACC 1020 

AGGAGCAACG CCTAAGGGGA CCATCACTGT TGATGGAGGA ACATATGATA TCTACGAGAC 1080 

TCTTAGAGTC AATCAACCCT CCATTAAGGG GATTGCCACA TTTAAACAAT ATTGGAGTGT 114 0 



is.? 
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TCGAAGATCG AAACGCACGA GTGGCACGAT TTCTGTCAGC AACCACTTTA GAGCGTGGGA 
AAACTTAGGG ATGAATATGG GGAAAATGTA TGAAGTCGCG CTTACrGTAG AAGGCTATCA 

AAGTAGCGGA AGTGCTAATG TATATAGCAA TACACTAAGA ATTAACGGTA ACCCTCTCTC 

AACTATTAGT AATGACGAGA GCATAACTTT GGATAAAAAC AATTAAAAAT CCTTATCTCT 

TTCGGTTCAG TTCTCATTAT TTTCAAATAA CCTCCCGGTr GGATCmTC CAACGGGAGG 

TTTTArrGGA AAGGTTAAGT ATAGTATACT CCGATTCCAT CCAGAGGAAT GCITGAAACA 

CCTCCGTCAC TAG 

(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 
J;l (A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



1200 
1260 

1320 

1380 

1440 

1500 

1513 



AAATTGAATT GTGTATATCT AATGATAACG ACAAATCGTC ACTGTTTTTA AACTAATCTC 



60 



it 

m 
n 

ts::- 

i"?. 



^0 
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IB AGCAGCTATC CAAAAAACAC TGATGTTGAC CTCTTAAAGA AGTGTCACTA TCTATGAAAA 



GATAATTATC CAGITTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 

TTAAAAGGAG GAATGCCTA ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC 

Met Arg Gin Lys Lys Leu Thr Leu He Leu Ala 



{"% -25 -20 

Mir 

\.,h 

TTT TTA GTT TGT TTT GCA CTA ACC TTA CCT GCA GAA ATA ATT CAG GCA 
Phe Leu Val Cys Phe Ala Leu Thr Leu Pro Taa Glu He He Gin Ala 
-15 



10 -5 -1 



CAA ATC GTC ACC GAC AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT 
Gin He Val Thr Asp Asn Ser He Gly Asn His Asp Gly Tyr Asp Tyr 
1 5 10-15 

GAA TTT TGG AAA GAT AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT 
Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met He Leu Asn His 
20 25 30 



120 



180 



AAACCAATAC TTCTTTATTT AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTTT 
ATAGGAACTT TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 

CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC TCATTCATAC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 300 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 360 
TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TGATCAAACA 



420 



480 



540 



600 



652 



700 



748 



796 



GGC GGT ACG TTC AGT GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC 
Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe 
. 35 40 45 



CGT AAA GGT AAA AAA TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT 
Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

AAC ATG TCC ATA AAC TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG 
Asn Met Ser He Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 
65 70 75 80 

TAT TTA TGC GTC TAT GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT 
Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

ATT GTC GAC AGT TGG GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG 
He Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

GGG ACC ATC ACT GTT GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT 
Gly Thr He Thr Val Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu 
115 120 125 

AGA GTC AAT CAA CCC TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT 
Arg Val Asn Gin' Pro Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr 
130 135 140 

TGG AGT GTT CGA AGA TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC 
Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr He Ser Val Ser 
145 150 155 160 

AAC CAC TTT AGA GCG TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG 
Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 
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TAT GAA GTC GCG CTT ACT GTA GAA GGC TAT CAA AGT AGC GGA AGT GCT 127 6 

Tyr Glu Val Ala lieu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 . 185 190 



AAT GTA TAT AGC AAT ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT 1324 
Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 



ATT AGT AAT GAC GAG AGC ATA ACT TTG GAT AAA AAC AAT TAAAAATCCT 1373 
lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 



TATCTCTTTC GGTTCAGTTC TCATTATTTT CAAATAACCT CCCGGTTGGA TCTTTTCCAA 
CGGGAGGTTT TATTGGAAAG GTTAAGTATA GTATACTCCG ATTCCATCCA GAGGAATGCT 
TGAAACACCT CCGTCACTAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AAATTGAArr GTCTATATCT AATGATAACG ACAAATCGTC ACTGTmTA AACTAATCTC 60 

120 



AAACCAATAC rrCnTATTr AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTrr 



ATAGGAACTT 



TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 



180 
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CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC TCATTCATAC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 300 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 3 60 

TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TGATCAAACA 420 

AGCAGCTATC CAAAAAACAC TGATGTTGAC CTCTTAAAGA AGTGTCACTA TCTATGAAAA 4 80 

GATAATTATC CAGTTTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 54 0 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 600 

TTAAAAGGAG GAATGCCTA 6 IS 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

TAAAAATCCT TATCTCTTTC GGTTCAGTTC TCATTATTTT CAAATAACCT CCCGGTTGGA 
TCTTTTCCAA CGGGAGGTTT TATTGGAAAG GTTAAGTATA GTATACTCCG ATTCCATCCA 
GAGGAATGCT TGAAACACCT CCGTCACTAG 



(2) INFORMATION FOR SEQ ID NO: 14: 



- 69 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCCCCCTACG TAGCGGCCGC CCCGGCCGGT AACCTAGGAA GTCAGCGCCC TGCACC 5^ 
^ (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
^-^^ nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCCCCCTACG TAGGCCGGGG CGGCCGCGGT TACCTAGGGC CTCGTGATAC GCCTAT 56 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lin ar 



U 

I r* = 

nil 



10 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo 
nucl otide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

ACGAGGAAAG ATGCTGTTCT TGTAAATGAG T 



(2) INFORMATION FOR SEQ ID NO: 17: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 

nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TACCTTGTCT ACAAACCCC 19 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
20 nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CGGTCGCCGC ATACACTA 



18 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE : nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

CCCCCCCCCG GTAACCTGCA TTAATGAATC GGCCAA 36 

10 (2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 



nucleotide) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



CCCCCCCCCG GTTACCGTAT TTATTAACTT CTCCTAGTA 



39 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: nucleic acid (synth tic oligo- 
nucl otide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CCCCCCTCTA GATTAATTAA CCAAGCTTGG GATCCGTCGA CCTGCAGATC 50 

(2) INFORMATION FOR SEQ ID NO:22: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

p (D) TOPOLOGY: linear 

^0 



i H 

ilij 



isth 

1 



10 (ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 

nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCCCCCTGAA ATCAGCTGGA CTAAAAGGGA TGCAATTTC 39 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
20 nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



CCCCCCGTCG ACCGCATGCG CCGGCACAGC 



30 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CCCCCCGCAT GCGCAAATCG TCACCGACAA TTCCATTGG 39 

10 (2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
X5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TACCTTGTCT ACAAACCCC 19 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCATGTAACT CGCCTTGATC TATTTCATTT GTATCAAAGG ATTTATACAC AAACAAGAGA 60 
CATCCATGCC GGGTTAAAGC AGTATCGTTC CATCTAACAG AGAAGGNCTG CATGAAAGGA 12 0 

GGTGATGGGT TTTTCATCTT AGGGATGACA GAACAATACG GATGAAAAAA GGAGAGGGAT 180 
GGAAA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATGAATTTGA AAAGATTGAG GCTGTTGTTT GTGATGTGTA TTGGATTTGT GCTGACACTG 60 
ACGGCTGTGC CGGCTCATGC G 81 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



ATG AAT TTG AAA AGA TTG AGG CTG TTG TTT GTG ATG TGT ATT GGA TTT 
Met Asn Leu Lys Arg Leu Arg Leu Leu Phe Val Met Cys lie Gly Phe 
1 5 10 15 



48 
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GTG CTG ACA CTG ACG OCT GTG CCG GCT CAT GCG 81 
Val Leu Thr Leu Thr Ala Val Pro Ala His Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met Asn Leu Lys Arg Leu Arg Leu Leu Phe Val Met Cys lie Gly Phe 
1 5 10 15 



Val Leu Thr Leu Thr Ala Val Pro Ala His Ala 
20 25 



